Protein Synthesis- Degradation, 
A Stochastic Approach-part I 

Abstract 

In this work, we study a protein synthesis degradation process 
by defining a general mathematical model. Using generating func- 
tion technique we present a method that allows exact calculation of 
joint probability distribution of protein numbers for a two dimensional 
birth-death process with interaction. We discuss the model in steady 
state for a particular choice of states and transition rules and find 
exact solutions. 

Keywords: protein synthesis degradation, birth death stochastic process, 
analytic calculations, generating function. 
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1 Introduction 

Proteins are essential macromolecules that serve both as structural compo- 
nents of the cell and as its enzymatic machinery. When stochasticity in this 
processes is ignored the deterministic Michaelis-Menten model which can be 
understood also as a mean field approach is a good approximation. In most 
cases stochasticity plays a significant role in the process that can not be ig- 
nored, (ref Shai PNAS 2008). For example gene expression in both prokary- 
otes and eukaryotes is inherently stochastic [1, 2, 3, 4]. The regulation of 
genes by transcription factor proteins is a stochastic process as well, owing 
to the small numbers of copies of molecules involved. With the development 
of imaging techniques in molecular biology, we are able to observe directly 
the fiuctuations in the concentrations of proteins and mRNAs and, by mea- 
suring the intensity profiles of fiuorescence markers, measure full probability 
distributions [5,6]. Such stochasticity is both controlled and exploited by the 
cells and, as such, must be included in models. 

Protein synthesis-degradation is a tightly gene expression regulated cellu- 
lar process that affects growth, reproduction, aging and survival in response 
to both intrinsic and extrinsic cues, such as nutrient availability and en- 
ergy levels. The turnover of these proteins (synthesis and degradation) is a 
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stochastic process that plays a critical role in all biological processes. 

Most of the modeling activity of the last decade regarding protein-protein 
interactions has centered on stochastic simulation of individual realizations, 
i.e., Monte-Carlo methods or on various numerical approaches [7-17]. How- 
ever numerical simulations are naturally limited to a specifc choice of pa- 
rameters, and changing the parameters requires a completely new calcula- 
tion. Furthermore they suffer from the curse of dimensionality: the com- 
putational runtime grows prohibitively as the number of species increases. 
These problems can be bypassed by developing analytical approaches, which 
often require certain approximations. 

The aim of this work is to explore minimal models of protein synthesis- 
degradation and gain some analytical insight. For clarity of exposition, in 
next section we present the analytic tools used to find the steady state prob- 
ability distributions of protein copy numbers in the cell and introduce the 
model in the general form. After that we are illustrating several concrete 
cases for which analytic progress can be made either instead of numerical 
computation or right before turning to numerical approaches. 

Given that biochemical processes frequently involve small numbers of 
molecules (e.g. a few molecules of a transcriptional regulator binding to 
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one 'molecule' of a DNA regulatory region) and such reactions are subject 
to significant stochastic fluctuations we consider that the minimal models 
considered here are relevant to such context and therefore important to be 
studied at analytical level. 

2 Methods and general framework of the model 

We envision a protein synthesis degradation process as a continuous in time 
birth death Markov process with a discrete (very large) state space. When 
two different species or types of proteins with a large number of possible 
states are involved the stochastic model that describes the process can be 
considered a two dimensional birth death Markov process. The interaction 
between the two protein types, I and II is "felt" in the creation rate of the 
protein type II as described bellow. The master equation gives the flow for 
7r(j, k; t) — TTjk, the probability of there being j copies of the 1st species, k 
copies of the second, at time t. For a simple case where each reaction either 
creates or annhilates one and only one component, and for the simple case 
where birth/decay rates are constant, we have the following master equation 
describing the time evolution of the probability distribution 
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dTTjk 



j=i k=i 



+ ^jk-ibjk - T^jkj^ - T^jkkd) 



oo oo 



(1) 



where: irjk is the joint probabihty distribution of type I and II to have j 
respective k quantities; S and d are the rates of annihilation of protein type 1 
respectively 2 which are some constants proportional to the existing number 
of protein copies. The rates of creation for protein type 1 with j copies is (3 
and for protein type 2 with k copies is bjk,- 

The two protein types interact in the following way: when protein type 
1 gets to a certain threshold 9, protein type 2 is changing the creation/birth 
rate according with bellow step function: 



We will study analytically the stochastic model formulated above in the 
steady state case for a particular choice of states and rules of state transitions. 

Solving master equation (1) analytically for the long time behavior of 
TTjk is generally an impossible task when the state space is very large. One, 
therefore, has to resort on various techniques. One such technique often 
used successfully in stochastic processes literature is the "generating function 
technique" [18-20]. We remind the reader of some well-known aspects of this 




bo, when j < 6 



bi when j > 6 
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technique in order to make the present discussion self-contained. 

Assume j is a discrete random variable and assume, for convenience, the 
state space is {0, 1, 2...}. Let ttj be the probability mass function of j where 
Yl'^o'Kj — 1; the mean of j satisfy: /ij — E{j) — Y.'^oj'^j- The probabihty 
generating function (p.g.f.) of the discrete random variable j is defined by 

oc 
3=0 

for some xeR, because Z^^o'^^i — 1' above sum converges absolutely for 
\x\ < 1. As the name implies, the p.g.f. generates the probabilities associated 
with the distribution, where fj{0) = ttq, /j'(0) = tti, /"j(0) = 2\tt2, and in 
general /"(O) = n\pn- The p.g.f. gives entire information associated with the 
distribution. 

3 First minimal model considered: Two state 
model 

Given that real biological systems frequently involve small numbers of molecules 
we developed a minimal model where the second protein type can be in 2 
possible states: present or absent. An example of such situation in real life 
would be a genetic switch on/off. 
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The two protein types undergo a birth/death process with interaction. 
The creation and anihilation rates are the same as in general model with the 
following specifications: type I protein can have any number of copies/states, 
while type II protein can only have 2 possible copies/states: or 1, meaning 
we have no protein or just one protein. The interaction between the 2 protein 
types is as following: the creation rates of the second protein type, bjk, will 
directly depend by the number of copies of first protein type as bellow: 

bo, when j < 9 
hi when j > 9 
when k > IVj 

For simphcity we will chose 9 — 1 but the result can be easily generahzed, 
see fig. 2 (with arrows and transition rates) 

For this model, in steady state the master equation (1) is replaced with 
bellow equations, where we keep the same notation as in equation (1). 



bjk = < 



j = 0,k = 0: TTooiP + bo) = TTio6 + TToid, (2) 

j>l,/c = 0: 7rjo(i(^ + /3 + 6i) =7rj+i,o(i + l)5 + 7rjid + 7rj_i,o/9, (3) 

j = 0,A; = l: 7roi(^ + d) = 7rii(5 + 7roo6o, (4) 

j >l,k=l: -Kji{j5 + j3 + d)^ ^j+i,iU + + ^joh + (5) 

7 



Using generating function technique we simplify our problem by transform- 
ing the above equations into ODE's satisfied by the generating function. 
Let fk{x) = Yl'jLo'T^jkX^ be the probability generating function (p.g.f); the 



solutely for I a; I < 1. According with the above notation we have: /a;(0) = ttqa;, 



/o(0) = TToo, /i(0) = TToi, and /o(x) = Ej°lo /i(^) = T.f=QT^nxK 



Given that p.g.f. generates the probabilities associated with the distribu- 
tion as previously explained, the problem is to find an analytical expression 
for /o(0) since once we would know such expression, we have all the necessary 
information to know the joint probability distribution for the 2 protein types. 

Note that for the case when we have just one protein type undergoing 
a birth death process with a constant decay/birth rate is a well known 
fact that in steady state, its stationary probability distribution is a Pois- 
son distribution [19,20]. Since protein 2 doesn't influence protein 1, the 
marginal distribution of protein 1 is still given by the Poisson distribution: 
Pj — "^jo + TTji = ^1(1)^ e~l^/^ . It follows that marginal, in the generating 
function notation is correct to right: 



differential of p.g.f.is:/jj(a;) = Y^'^aj'^jk^- 



^; where Y^JLo'^jk^^ converges ab- 



oc 00 
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00 1 
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^ g(x-l)/3/5 



(6) 



and 



foi^) + f[{x) = ^e(^-'^^/' (7) 



Using generating function technique on equations 2,3 an ODE equation 
(eq.8) satisfied by a generating function is derived (see Appendix A for a 
detailed derivation) 



x5^Mx) + {P + h)Mx) + {bo-h)fo{0) = 6-^Mx) + Mx)d+Mx)Px (8) 



same procedure applied on eq 4,5 and obtain the following equation: 

x5-^h{x) + {/3+d)Mx) = (bo-h)fo{0) + 6^Mx)+Mx)b,+xf,{x)/3 (9) 

Steps toweird obtaining /o(a;) : Using condition (6) in eq.(8) I obtain a 
new equation (eq.lO)in the fo{x) as unknown which once solved gives me the 
expression for fo{x) generating function. 

{x-l)S-^fo{x) + {-^{x-l) + b, + d)fo{x) = de(^-iW^ + (6i-6o)/o(0) (10) 

This is a first order ODE; Using integrand factor method one gets after some 
calculations (see Appendix B for a detailed derivation in solving eq.lO) 



/o(x)(l-x)(''i+'^)/V3/^--/o(0) 
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bi + a 6 Jo 

(11) 

Setting a; = 1 in equation 11 one obtains /o(0) and than going back at eq.ll 
one gets an expression for fo{x) 



4 RESULTS 



Setting x = 1 in eq. (9) one obtains /o(0): 



/o(0) = 



(12) 



bi + d [i_h^j^ e-(/5/%(l - y)^~^dy_ 
Given that the derivatives of generating fct at zero gives the probabilities 
associated with the distribution, we have: 



TToo = /o(0) 
^20 = f^{0) 



Prom (6) we have fi{x) — e^^~^^^^^ — fo{x) and then for x = one obtains: 
A(0) = e-/^/^-/o(0) 



10 



therefore,in same way I can easily get: 

7roi=/i(0) 
7rii=/;(0) 

5 Discussions and conclusions 

Using the result above one can determine the joint probability distribution 
in the stationary state of having a given number of proteins type I and II 
in the system given that the protein type II can have just 2 possible states. 
Next I will expand this result for the case when protein type II can have 
more than two states involved in the process. I will show how one can 
use generating function technique on an expanded version of the 2 state 
model, when protein type II can have more than two states involved in the 
process. The complexity of the calculation that such model requires will help 
us conclude that numerical approximations in conjunction with the analytical 
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result obtained in the 2 state model are necessary if one wants to explore 
the model as described by master quation (1) for a general 2 dimensional 
birth/death process when the 2 protein types can have j respectively k copies, 
with j,k being any integers. Using generating function technique I've shown 
how one can get an analytical expression for joint probability distribution of 
the 2 protein types that undergo a birth/death process with interaction as 
described in the 2 state model. 

The analytical result obtained in the 2 state model can be used to help 
developing appropiate numerical methods for approximating results of the 
stochastic process when the second protein type has A; > 3 states. 
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Appendix 



A Apply generating function technique in two 
state model for obtaining an ODE satisfied 
by g.f.: eq. (8) 

Using generating function technique on equations 2,3 (in equation 2+3 mul- 
tiply by and sum over j from to oo each term) an ODE equation (eq.8) 
satisfied by a generating function is derived. 

In equation 2+3 multiply by and sum over j from to oo each term. 



^ X^i^jOa^^ + + bi)7ljQX^ + /^TToo + ^oTToo 



oo 

J 



j=l j=l j=l j=o j=o 



where each term in the equation can be written: 



oo oo ^ 

j=l j=o 
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II 



j=l j=0 



III 

^7roo = /3/o(0), 

IV 

bonoo = 60/0 (0) 

V 

00 00 

+ l)7r(j+i),o2;^ = + l)7r(j+i),o2;^ - Sttio 

3=1 j=o 

VI 

00 00 
j=l j=l 

VII 

00 

VIII (^TTio , IX dTToi 

introducing all terms from I- IX back in equation we obtain 
equation (8): 



^^j^U^) + (/5 + ^i)/o(^) + (^0 - 6i)/o(0) = Sj^hix) + h{x)d + /o(x)/3. 
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Similarly, applying generating function technique on eq 4,5 yields the 
following equation: 

(9) 

^5^/1 (a;) + (/3 + d)f,{x) = {bo - 6i)/o(0) + S^hix) + Mx)W + xh{x)P 

Further, using expression (6) in eq.(8) I obtain equation (eq.lO) for /o(a;) 
(10) 

{x - l)^^/o(^) + - 1) + 61 + d)fo{x) = de(-- W + [p^ _ ^,q)/q(o) 

B Derivations of eq.8 in two state model- 
Integrating factor method 

(8) 

{x - i)S^Mx) + {-/3{x - 1) + 61 + d)fo{x) = de(--i)^/^ + (61 - 6o)/o(0) 
multiply eq. (8) with the integrand factor: 



{x - l)dfo{x) + {-(3{x - 1) + 61 + d)fo{x] 
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where: 



/o {y - 1)5 

therefore: 



from here we get by dividing with [x — 1)5: 



f^^^)j: '-"^w^^y^j = l/{x-l)5 {de^^-'WS + {h - 6o)/o(0)) e-(^/^)-e«''^+'^)/^)'<'^(^--) 

Integrating this eq. we obtain: 

(10) 

fo{x)e^o ''%'-Z"'''y-f^^o) = - £ (de^y-'WS + (5^ _ bo)fo{0)) e-'^^/''>y {1/5)1/ {l-y)'^'-^''+ 
or equation. (11. a): 

/o(a;)(l - ^)(fi+<^)/5e-^/^- - /o(0) + h; 

where 

^1 ^/"io (^y - iY{-b+d)/5) + i^y 

^ _rfe-^/5_L^(l _ (1 _ ^f,+dys^ 



and 
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by introducing the expression Ii and I2 in equation 11 we obtain the 
expression for fo{x): 

[11] 

/o(x)(l-x)(''^+'^)/V^/^^-/o(0) = 

-de-^/^-L.(i - (1 - ,f^^<iys^ - - r e-Hl - yf-^-'^dy 

bi + a Jo 
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